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Universality in the behavior of complex systems 
often reveals itself in the form of scale-invariant 
distributions that are essentially independent of 
the details of the microscopic dynamics. A rep- 
resentative paradigm of complex behavior in na- 
ture is cooperative evolution. The interaction of 
individuals gives rise to a wide variety of collec- 
tive phenomena that strongly differ from indi- 
vidual dynamics — such as demographic evolution, 
cultural and technological development, and eco- 
nomic activity. A striking example of such co- 
operative phenomena is the formation of urban 
aggregates which, in turn, can be considered 
to cooperate in giving rise to nations. We find 
that population and area distributions of nations 
follow an inverse power-law behavior, as is known 
for cities |[|-|5|]. The exponents, however, are rad- 
ically different in the two cases (/x ~ 1 for nations, 
jit » 2 for cities). We interpret these findings by 
developing growth models for cities and for na- 
tions related to basic properties of partition of 
the plane. These models allow one to understand 
the empirical findings without resort to the intro- 
duction of complex socio-economic factors. 

The way in which urban aggregates are distributed was 
first investigated by Zipf || who, half a century ago, ob- 
served that the population distribution of cities follows a 
power-law behavior with exponent fj, ks 2. This Zipf law 
has a "universal" character since it holds at the world 
level as well as within a single nation, and the exponent 
is essentially independent of the area of the nation and its 
socio-economical conditions. More recently it has been 
observed 0|| that the area distribution of satellite cities, 
towns and villages around huge urban centers also obeys 
a power-law with exponent fi ~ 2. 

The remarkable universal result n » 2 has very re- 
cently attracted the attention of a number of physi- 
cists who model urban growth processes ||-||- Makse 
et al. j|,[3), using a correlated percolation model in the 
presence of a density gradient, reproduced the observed 
morphology of cities and the area distribution of sub- 
clusters in an urban system. Zanette and Manrubia || 
proposed a stochastic model which generates intermittent 



spatiotemporal structures, and predicts a population dis- 
tribution in agreement with that observed empirically. 
Very recently Marsili and Zhang (]]] proposed a model 
based on a master equation approach which is able to 
give a population distribution close to that obeyed by 
cities. The transition probabilities which enter the mas- 
ter equation were related to some estimate of the interac- 
tions among individuals living in the same city. Though 
the concept of interaction among human beings is not so 
clearly defined as that among particles, it is nevertheless 
true that individuals living in the same city are related 
to each other by a number of "links" which in the end 
define the very concept of city. 

People living within the same nation are also related to 
each other — e.g., they share language and cultural her- 
itage. Since the average "interactions" among the inhab- 
itants of a nation may differ from those among people 
living in the same city, one could ask if the distribution 
of nations obeys the same law as that of cities. In order 
to answer to this question we analysed the population 
and area distributions of the world's nations. The log- 
log plot of the population distribution f(P) is shown in 
Fig. a for all nations of the world. In the same figure we 
show also the population distribution for the 140 largest 
city agglomerates of the USA. Both distributions obey 
a power law dependence, f(P) ~ P~^ p . Regression fits 
give hp = 0.97 for nations, and \ip = 1.94 for cities. Fig- 
ure b shows a log- log plot of the area distribution f(A) 
for all nations of the world as well as for nations belong- 
ing to a subset of all nations (Europe). In both cases 
f(A) obeys a power law dependence, f(A) ~ A~>* A with 
exponent close to unity. 

The strikingly different values of the exponent for cities 
and nations suggest fundamental differences in the histor- 
ical and social processes that lead to such distributions. 
Here we propose a simple explanation of the distributions 
of nations and cities based only on geometrical consider- 
ations. We observe that, at least in principle, there is no 
restriction to the land accessible to a nation except that 
of the total existing land: a sufficiently powerful nation 
could expand to absorb all other nations. Cities, be- 
ing the result of spontaneous aggregation of individuals 
around sites having attractive features, can form away 
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from existing ones. This separates the plane region into 
land basins, and each new city spans a single basin. The 
resulting distribution of areas is not destroyed when a 
city expands to absorb nearby cities and gives rise to a 
compact urban aggregate. In fact, unlike nations, cities 
usually do not lose their land to neighbours: small towns 
and villages retain their identity and usually become ad- 
ministrative districts of the bigger aggregate (obeying, as 
shown in Ref. , the same area distribution that holds 
for separate cities). Thus a major difference in the way 
according to which land is occupied by cities and nations 
is that for cities the accessible land is fragmented into 
basins while it is not for nations. 

Next we model the land occupation processes of na- 
tions and cities. We first note that the above consid- 
erations suggest that geometric properties may be be- 
hind the differences between city and nation distribu- 
tions. Hence we analyse these processes as random par- 
titioning^ of the plane. Since different nations (or cities) 
do not all form at the same time, we consider partition 
processes in which the different portions are sequentially 
selected ||£o). 

One of the simplest ways of partitioning a plane is to 
divide it using straight lines that are randomly oriented 
and positioned. Each line divides the region into two 
portions, of which the smaller is selected and the larger 
is further partitioned. This process is close in spirit to 
the way in which land is occupied by nations. Because 
this partition model resembles the positioning of fences, 
we refer to it as fence model. 

This new model can be solved analytically. After n 
partitions, the land "available" for further division is A n , 
and A n — r n A n -\, where r n is a random factor uni- 
formly distributed between 1/2 and 1. The area T n of the 
"taken" portion is T n = (1 — r n )A n -x, and its logarithm 
can be written as In T n — X)"=i hir-i+hi(l— r n )+ln Aq. If 
we plot on the x axis the values of lnT„, we get random 
points, the average distance between which is equal to 
— (lnrj) = 1 — In 2 w 0.3. The largest nation corresponds 
to almost an entire continent Ao/2£, and the smallest one 
corresponds to Ao(l/2£)e~°- 3n , where n is the number of 
nations on the continent, and the factor 2£ corresponds 
to the average (ln(l — 7-4) « —1.7. Thus, the distribution 
of the logarithm of a nation's area in each continent is 
a flat distribution between IiiAq — 0.3n -17 and In A^ 1 ' 7 
described by the probability density P(lnS') = 1/0. 3n. 

The world has 5 continents; some, such as Australia 
and North America, have very few nations and some, 
such as Europe and Africa, have many. Because there 
are, on average, approximately n — 50 nations per conti- 
nent, we can expect the approximate distribution of the 
logarithms of the number of nations to be uniformly dis- 
tributed between the average largest country ~ 10 3 km 2 
and 10 7 • e -15 ~ 3km 2 , which is consistent with a spread 
observed in the distribution of real nations. The flat dis- 



tribution of the logarithms const c?(log S) corresponds to 
the distribution (const/ S) dS of the areas, which is close 
to what we observe in Fig. lb. Note, however, that, due 
to the few nations in each bin, a particular realization of 
the partition process described above may significantly 
deviate from the flat distribution we would expect. 

The above model could be, however, oversimplified. 
This concern can be alleviated by incorporating into it 
the possibility that nations can evolve, growing or shrink- 
ing. Consider, e. g., the variant of the fence model 
(called, in the following, evolution model) in which with 
probability 1/2 a nation can grow or shrink by some given 
amount. The P(A) histogram, remarkably, is not af- 
fected. To see this, we first note that such change in area 
corresponds to the variable \og(A) increasing or decreas- 
ing by a constant number, i.e., a simple random walk in 
the variable \og(A). As we mention above, P(A) ~ 1/A 
is equivalent to P[\og(A)} ~ const. 

In the present case, the random walk is confined by 
reflecting boundaries: A max = Aq is some minimal size 
^4-min below which the nation is not stable. The distri- 
bution of a random walk confined in an interval with 
reflecting boundaries converges to a uniform distribu- 
tion p| . Thus P[log(A)} converges to a constant and 
hence P(A) ~ 1/A. The the P(A) distribution is immune 
to the "noise" of growth and shrinking. The distribution 
of nation areas, simulated using the evolution model, is 
shown in Fig. lc. The distribution of the logarithms of 
the nation areas are shown in Fig. Id. 

To model city distributions, we shall consider a radi- 
cally different way of partitioning the plane. As suggested 
by the way urban geographers have thought about "cen- 
tral place" theory and the hierarchy of towns (l^,|l^l , we 
assume that cities have on the average a circular shape 
and thus can be approximately represented as circles. 
Land occupation by cities can thus be modeled through 
the partition of the plane in nonoverlapping circles. Each 
new portion is a circle (with radius chosen from a uniform 
distribution) at a randomly-chosen position, but outside 
of previously-selected circles. The maximal area of each 
new circle is limited by the distance from the closest ex- 
isting circle. The resulting fragmentation of accessible 
land reduces the space available for the next circle more 
rapidly with respect to what happens for the portions 
generated through the fence model, thus making small 
portions much more probable. Hence we expect [ia to be 
larger for the present model. 

We simulate the circle model and find that the distri- 
bution of the circle areas follows a power-law behavior 
which is close to the empirical data of Fig. la. A linear 
fit of the distribution (with the exclusion of the region 
affected by finite size effects) gives an exponent /1 ks 1.94 
(Fig.le). One advantage of the circle model, compared 
to the previous models [Q-Q, is that it is based only on 
the geometrical features of the land occupation process. 

Another advantage of this model is that it can be 
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solved analytically in the limit when the area of the 
newly-formed circle is a small number proportional to 
the area of the unoccupied land closest to the center 
of this circle, with proportionality coefficient k <C 1. 
Suppose that there are N circles in a region of to- 
tal area Aq. Then the probability that a randomly 
chosen point is surrounded by an empty space of area 
larger than X is governed by a Poisson distribution 
F(X,N) = exp(-XN/A ). The number of circles 
with area between A and A + dA is p(A, N)dA = 
N/(A k)exp(-AN/A k)dA, the derivative of the Pois- 
son distribution function. The distribution of circles, 
P(A), at some given instant when there are N c circles 
is given by the integral J^ c p(A,N)dN = (kA /A 2 )[l - 
exp(-AN c /A k)]. For large N c , we recover P(A) — A~ 2 , 
so [ia — 2, in agreement with empirical findings. It is in- 
teresting to note that the exponent [ia — 2 is robust 
since, as our simulations show, it holds even in the case 
when k is not a small number, but is any random num- 
ber constrained only by the fact that two circles cannot 
overlap (as for urban agglomerates that consist of tightly- 
packed villages and towns). 

In summary, we show, through the analysis of accurate 
geographical and demographical data, that the nation 
population and nation area distributions obey power- 
laws. The exponent is, however, surprising and com- 
pletely unexpected. Moreover, its value (1) is quite dif- 
ferent from the known exponent for cities (2). In addition 
to our empirical discovery, we propose an explanation for 
why the population distribution exponent of cities differs 
so strikingly from that of nations. 
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FIG.l. (a) Double-logarithmic plot of the histogram of the 
population, P, of the 255 world nations and 140 largest ur- 
ban agglomerates in the USA in mid-1997. For nations, the 
slope gives /j,p = 0.97 and the linear regression coefficient is 
R = 0.97; for cities ^ P = 1.94 and R = 0.99. For visual 
convenience, city data are multiplied by 10 2 . (b) Double- 
logarithmic plot of the histogram of areas, A, of the 255 na- 
tions of the world and 49 European nations; the correspond- 
ing coefficients are R = 0.99 and 0.99 respectively. Euro- 
pean data are divided by 10 2 . The source for both plots is 
trttp: //www. stats . demon.nl. (c) Double-logarithmic plot of 
the histogram of areas for the evolution model, (d) Distribu- 
tions of logarithms of nations areas produced by (i) 255 na- 
tions and (ii) by the evolution model, (e) Double-logarithmic 
plot of the histogram of areas for the circle model; the linear 
regression coefficient is R = 0.96. 



4 



